On the statistical significance of nucleic acid similarities

نویسندگان

  • David J. Lipman
  • W. John Wilbur
  • Temple F. Smith
  • Michael S. Waterman
چکیده

When evaluating sequence similarities among nucleic acids by the usual methods, statistical significance is often found when the biological significance of the similarity is dubious. We demonstrate that the known statistical properties of nucleic acid sequences strongly affect the statistical distribution of similarity values when calculated by standard procedures. We propose a series of models which account for some of these known statistical properties. The utility of the method is demonstrated in evaluating high relative similarity scores in four specific cases in which there is little biological context by which to judge the similarities. In two of the cases we identify the statistical properties which are responsible for the apparent similarity. In the other two cases the statistical significance of the similarity persists even when the known statistical properties of sequences are modelled. For one of these cases biological significance is likely while the other case remains an enigma.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The statistical distribution of nucleic acid similarities.

All pairs of a large set of known vertebrate DNA sequences were searched by computer for most similar segments. Analysis of this data shows that the computed similarity scores are distributed proportionally to the logarithm of the product of the lengths of the sequences involved. This distribution is closely related to recent results of Erdos and others on the longest run of heads in coin tossi...

متن کامل

Retrieval accuracy, statistical significance and compositional similarity in protein sequence database searches

Protein sequence database search programs may be evaluated both for their retrieval accuracy--the ability to separate meaningful from chance similarities--and for the accuracy of their statistical assessments of reported alignments. However, methods for improving statistical accuracy can degrade retrieval accuracy by discarding compositional evidence of sequence relatedness. This evidence may b...

متن کامل

Statistical and Practical Significance of Articles at Sports Biomechanics Conferences

Background. The importance of using statistical approaches has increased and became necessary for researchers and specialists in sports biomechanics because they need more objective and accurate methods to increase knowledge. Objectives. Evaluate the reality of using practical significance in the articles published in scientific conferences in the biomechanical sport. Methods. One hundred twe...

متن کامل

Cellular Morphology and Immunologic Properties of Escherichia coli Treated With Antimicrobial Antisense Peptide Nucleic Acid

  Background & Objectives: Antisense peptide nucleic acids (PNA) that target growth essential genes show potent bactericidal properties without cell lysis. We considered the possibility that whether PNA treatment influence the bacteria total nucleic acids content and apply approach to develop a new delivery system to Dendritic cells (DCs). DCs are the most potent antigen presenting cells in th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Nucleic acids research

دوره 12 1 Pt 1  شماره 

صفحات  -

تاریخ انتشار 1984